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Abstract — This work characterizes an important solution con- 
cept of a relevant spectrum game. Two energy-efficient sources 
communicating with their respective destination compete for an 
extra channel brought by a relay charging the used bandwidth 
through a pricing mechanism. This game is shown to possess 
a unique Nash bargaining solution, exploiting a time-sharing 
argument. This Pareto-efficient solution can be implemented 
by using a distributed optimization algorithm for which each 
transmitter uses a simple gradient-type algorithm and alternately 
updates its spectrum sharing policy. Typical numerical results 
show to what extent spectral efficiency can be improved in a 
system involving selfish energy-efficient sources. 

Keywords: resource allocation, cognitive radio, cooperative 
transmission, Nash bargaining solution. 

I. Introduction 

Designing spectrally efficient communication systems has 
always been, and still is, a critical issue in wireless networks. 
The need for energy-efficient terminals at both the mobile and 
fixed infrastructure sides is more recent but becomes stronger 
and stronger This paper precisely considers both aspects. 
More specifically, the main goal is to determine an energy- 
efficient operating point of a given distributed communication 
system at which the spectrum is efficiently used. We do 
not pretend to solve this tough issue for general distributed 
multiuser channels. Rather, we show that it is possible to fully 
determine such an operating point in one possible scenario 
which has already been considered in the literature [1]. This 
scenario is as follows. We consider an initial communication 
system comprising two point-to-point communications which 
use orthogonal channels (say in the frequency domain); a half- 
duplex relay using a dedicated band is added to the system in 
order to help the two transmitters to improve their energy- 
efficiency; the relay implements a pricing mechanism which 
is directly related to the amount of band used for relaying. 
The energy-efficiency metric under consideration is a quantity 
in bit correctly decoded per Joule and is defined as in |2 1. The 
situation where each transmitter aims at selfishly maximizing 
its individual energy-efficiency (with pricing) by allocating 
bandwidth on the extra channel on which the relay operates 
has been considered in [1| ; the solution concept considered 
therein is the Nash equilibrium (NE), which is shown to 
be unique but not Pareto efficient. Our goal is to consider 



another solution concept for this spectrum allocation game of 
interest namely, the Nash bargaining solution (NBS), moti- 
vated by the need to design efficient solutions in distributed 
wireless networks. Remarkably, such a solution exists for the 
considered scenario, is unique, and can be implemented in 
a decentralized manner according to the conjugate gradient 
algorithm. This confirms the relevance of this approach which 
has also been adopted in other contexts such as |3 | (NBS for 
power allocation games where transmission rates are optimized 
with no relay and pricing), ||4] (wireless sensors are energy - 
efficiently coordinated by the Raiffa-Kalai-Smorodinsky so- 
lution to communicate with a unique fusion center), or [5\ 
(multiple access channels without pricing are considered). 
Compared to these references, the present work makes a step 
towards implementing an efficient solution in a decentralized 
manner, which is known to be a challenging task Il6l|l7l. The 
algorithm proposed in this paper is decentralized in the sense 
of the decision but not in terms of channel state information 
(CSI), leaving this issue as a non-trivial extension of this work. 
This paper is structured as follows. In the next section, we 
introduce the system model as well as the spectrum allocation 
game. In section Hill we analyze the NBS and present the 
decentralized algorithm. In section |IV] numerical results are 
presented and discussed. Concluding remarks are proposed in 
section [V] 

II. Problem statement 

A. System model 

The communication system under study is represented in 
Fig. [U Source/transmitter i e {1,2} sends a signal y/piXi 
with power pi over two quasi-static (block fading) links: the 
link from source i to destination i whose channel gain is 
hii G C and the one from the source i to the relay r whose 
channel gain is hir E C. The total band associated with 
those two links is uj and the extra band allocated by source 
i to communicate with the relay is denoted by uji e [0,w]. 
The extra band available is precisely that offered by the relay. 
The relay operates in a half-duplex mode and is assumed to 
implement an amplify-and-forward (AF) protocol. Time is 
divided into blocks on which all channel gains are assumed to 
be fixed. Each block is divided into two sub-blocks [S]. Over 



the first sub-block (first phase), only the source can transmit 
and the signals received by the destination and relay nodes are 
given by: 



(1) 



where nu and (complex) additive white Gaussian 

noises (AWGN) with mean and variance cr'^. Following the 
relevant choice of JS), only the relay is assumed to transmit 
over the second sub-block (second phase): 
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where hri G C is the channel gain between the relay and 
destination i, jiri ^ JV{0, cr^), and Xri is the transmitted signal 
from the relay to destination i and, under the assumption made 
in terms of relaying protocol, expresses as: 

Vir 



(3) 



In the first transmission phase, the signal-to-noise ratio (SNR) 
associated with the source-destination channel i merely writes 
as: 

Pi\hu\'^ 



(4) 



The SNR associated with the second transmission phase is 
given by [IJ, L8J: 
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Interestingly, as proven by (jSj, when using maximal-ratio com- 
bining, the equivalent SNR corresponding to the AF protocol 
can be written in a simple form if the outage probability is 



the metric of interest. This writes as: 7, 
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(a) Direct transmission. 
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(b) Cooperative transmission. 

Figure 1: System model. 



B. Strategic form of the spectrum allocation game 

As motivated in [l], the spectrum allocation problem can 
be modeled by a strategic form game (see e.g., ff\). 

Definition 1: The game is defined by the ordered triplet 

g = {JC, {Si)ieK, {ui)ieK) where 

• /C is the set of players. Here, the players of the game are 
the two sources/transmitters, K, — {1,2}; 

• Si is the set of actions/strategies. Here, the strategy of 
source/transmitter i consists in choosing uji in its strategy 
set Si = [0, uj]\ 

• Ui is the utility function of each user It is given by: 



Ui(uJl,UJ2) 
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where a defines the spectral efficiency (in bit/s per Hz), / : 
[0, +00) — 7> [0, 1] is a sigmoidal efficiency function which can 
correspond to the packet success rate or probability of having 
no outage (emphasizing the link between energy-efficiency and 
the outage analysis conducted in (|8l). The parameter 5 is a 
(linear) pricing factor. 

As explained in the presence of the factor 6 > amounts 
to imposing a cost to the sources for using the relay; this cost is 
assumed to be proportional to the relaying band used. The first 
term of the utility function corresponds to the ratio between 
the goodput (net rate in bit/s) to the cost in terms of power 
(in J/s) for the direct link alone (whose band width equals 
uj—bJi), whereas the second term corresponds to the aggregated 
effects of the direct transmission and the relayed transmission 
(whose bandwidth equals uJi). This game is concave in the 
sense of Rosen and has a unique pure NE (see e.g., Q). 
The main problem is that the NE can be very inefficient as 
there exist some operating points at which both transmitters 
have better utilities. This motivates the study of more efficient 
solutions such as the NBS. The NBS analysis, the design of 
a simple distributed optimization algorithm to implement it, 
and proving its relevance in terms of performance constitute 
the main results of this paper. 

III. Nash bargaining solution analysis 

The objective of this section is to characterize the NBS of 
the game Q and to propose a simple distributed optimization 
algorithm for implementation since the function of interest to 
optimize can be checked to be strictly concave under certain 
operation conditions expUcated in Sec. IIII-DI 

A. Achievable utility region: Pareto boundary and convexity 

First, we study the properties of the achievable or feasible 
utility region, which is denoted by TZ. It is defined as the 
region formed by all the points whose coordinates are (ui, U2) 
that is: 



Tl = {(mi,U2) I {ll>i,uj2) e [0,0;] }. 



(7) 



For a given channel configuration or block of data (i.e., the hij 
are given), the region TZ is compact |3|, which follows from 
the compactness of Si and the continuity of Ui. However, it is 
not always convex. This prevents one from using bargaining 
theory which is based on the convexity of the achievable utility 
region. It turns out that, in the problem under consideration, it 
is relevant to exploit time-sharing (as done in Q for Shannon- 
rate efficient allocation games on the interference channel), 
which convexifies the utility region. Indeed, the main idea 
is to assume that coordination in time is part of the sought 
solution. The new utility region is: 



Proof: The point 
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) defines a threat point 



TZ ={{lJ,Ui + (1 - ^)u[,^U2 + (1 - Ai)w2) 

|0 < M < 1, iui,u2) e n, (u'i,M2) e 7^}. 



(8) 



During a fraction fi of the time, the users use {lui,uj2) to have 
(wi, M2). During a fraction (1— /i) of the time, they use another 
combination of bandwidths (ojJ^jWj) to obtain {u[,U2). Note 
that this region includes several points of interest. First, it 
includes {u^^ , ^2^^) which is the point corresponding to the 
unique pure NE of Q. Second, it includes the two points for 
which the sources or transmitters do not exploit the relay at 
all: wi = or W2 = 0. Let TZ* be the Pareto boundary of the 
convex hull TZ. Fig. |2] illustrates different operating points as 
well as the achievable utility region TZ and the Pareto boundary 
TZ* . The other elements shown in this figure are defined next. 
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Figure 2: The achievable utility region plus some key 
operating points. 



B. Existence and uniqueness analysis of the NBS 

The NBS can be characterized as follows: 
Proposition I: In the spectrum allocation game Q, there 
exists a unique NBS given by: 

max_ {u,-u^^)iu2-u^''), (9) 



(ui,«2)e7?,+ 



where 



and can always be reached, which ensures the existence of 
a solution to the above maximization problem. 

Regarding to the uniqueness of the NBS, Nash proved that 
it holds under certain axioms due to the existence of the 
convex hull of the achievable region and the threat point, as 
mentioned in |3|. As we have shown that the utility region 
can be convexified, this solution is also unique. 

Finally, the cooperative outcome (NBS) must be invariant 
to equivalent utility representations, symmetric, independent 
of irrelevant alternatives and Pareto efficient f9l. The NBS is 
therefore the unique solution resulting from the intersection of 
the Pareto boundary TZ* with the Nash curve which is defined 
as (Fig. |2]i: 



(ui,U2) = arg max 7r(Mi,-U2) 

(f 1 ,M2) 



(11) 



where 7r(Mi, ^2) — (wi — Wi )(w2 



,NE\ 



is the Nash product 



function. ■ 
Since the NBS determination is on the subregion TZ^, we 
stress that the utilities arising from the NBS are higher than 
those deduced with NE (see Fig. |2]). 

C. Decentralized algorithm for the NBS determination 

The proposed algorithm is based on the idea of determining 
analytically the unique maximum, which is the NBS from the 
resolution of the following system of equations: 



(I) 



doji 
duj2 



0, 



= 0. 



(12) 



TZ+ = {{UI,U2) \UI > U2 > u^""} 



(10) 



Mathematical resolution of such a system leads to solve 
two second degree polynomials in uji (for i e {1,2}), the 
discriminants of which are fourth degree polynomials in LUj 
(for j g {1, 2} \ {i}). The study of signs of the discriminants 
show that expressing the NBS analytically is a difficult task 
even by exploiting Ferrari and Cardan methods for high 
degree polynomials resolution. This study shows the interest 
in: (1) finding decentralized algorithms to compute the NBS; 
(2) designing distributed procedures to converge towards the 
NBS. The scope of this paper is about (1) and (2) but with 
the restriction that distribution is only performed in terms 
of decision and not in terms of channel state information. 
Instead of determining the maximum of tt, we propose to 
find the minimum of — tt, denoted after as TTm, by focusing 
on the conjugate gradient algorithm. One of the steps of this 
algorithm consists in determining a parameter denoted as 
Pk+i (which is defined next). Accordingly, many methods 
have been introduced such as: Fletcher-Reeves, Polak-Ribiere 
and Hestenes-Stiefel. Due to the efficiency of its convergence, 
we focus here on the second method based on calculating the 
Polak-Ribiere parameter lITOl . The spectrum sharing policy 
is updated in an alternating manner, just like the iterative 
sequential iterative water-filling algorithm ifTTI . However, in 
contrast with the latter, only the decision is distributed here 
and global channel state information is needed (through the 



Hessian matrix). 



Algorithm 1: Decentralized determination of the NBS 

(1) Set the position of the relay 

(2) a;° — (wj'jOj") (frequency initiahzation) 

(3) vo — — V7rm(a;°) (initiahzation gradient) 

(4) k=0; while ||t;fc|| > e 



a. ti 



(optimal parameter with Newton method 



where gk = \/7Tm{u!^), A„i is the Hessian matrix of tt^ and 
bj^ = (wj^jCj^) is the frequency bands at the k*^ iteration) 
tkVk (new frequency bands) 
'=+^(1) and tj2^^ = uj^{2) (alternated updates) 



c. uj 



k+l 



d. gu+i = V7r„(a;''+i) (new gradient) 



e. h+i = 

f- Wfc+i = -5fc+i 
k = k+l 
end 

(5) (o^f^^o.: 



.9fc+i(5fc+i - 9k) 



9l9k 



(Polak-Ribiere parameter) 



■ Pk+iVk (new descent direction) 



NBS , ,NBS\ _ 



) = (cj-^^(l),w^^(2)) where FI denotes 



Final Iteration 



D. Convergence of the algorithm ( Strict-concavity analysis of 
the TT function ) 

The proposed algorithm is ensured to converge to a NBS 
if TT is strictly concave. But this property is not always true. 
According to the previous study, the function tt is defined 
on the subregion TZ'^ which is formed by all the utilities 
(mi,U2) verifying Ui > uf^ for all i G {1,2}. Such a set 
can be determined when the NE point is fixed. Though, for 
each channels values, a NE can be identified. Consequently, 
the subregion TZ'^ depends on the channels values. In the 
following, for given locations of sources and destinations, we 
show that there exists a region in which the tt function is 
strictly concave. 

Proving the strict-concavity of tt amounts to proving the 
strict-negativity of the eigenvalues of its corresponding Hes- 
sian matrix, which is given by: 



an ai2 

021 ^22 



(13) 



where: 



an 



0.22 



ai2 



021 



-26(u2 - 
doj2 

-2b(ui - 

dujidu!2 
-b{u2 - 
{ipi - ipi 

diUldu!2 

ai2, 



with ipi = af{ji^i)/pi and = af{'^ff)/{pi Pr) for 
i e {1,2}. Therefore, the eigenvalues are the zeros of the 
following polynomial: 



,NE 



) - 26w2(-<(3i + V-'i - b{2uji + W2)): 



,NE 



P : - A tr(v4) +dct(A). 



(14) 



If we denote A the discriminant of the polynomial P, we can 
verify merely that A is always positive. Indeed, we have: 

A = (flu - 022)^ + 4ai2a2i. (15) 

Since 012 = 021, equation ( fTSI ) is equivalent to: 

A = (an -022)^ +4(ai2)2 > 0. (16) 

Therefore, the eigenvalues of A, denoted as Ai and A2 are real 
and are as follows: 



Ai = 



A. 



tr(^) 



tr(^) + VA 



(17) 



From these expressions, we study the the strict negativity of 
the eigenvalues depending on the relay position in a space 
region [0, 700] x [0, 700] m^. The corresponding simulations 
(for the same settings considered in section II V) are given in 
Fig. [3] in which we represent in black the region where the 
eigenvalues are strictly negative. We assume a standard choice 
for / for all the numerical results provided in this paper which 
is f{x) = (1— e^^/^)*^ |2 | where AI is the number of symbols 
per packet. Therefore, we can deduce that the strict concavity 
of function tt is ensured in a region {xr,yr) S [400,550] x 
[400, 550] m2. 




) — 2buji{—ip2 + V'2 — b{2uj2 + t^i)). 
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Figure 3: Strict-concavity of the tt function on the disk TZ'^ 
when both eigenvalues are strictly negative. 

IV. Numerical results 



,NE 



) - b{ui 



,NE 



Here, we implement the NBS and compare it to the NE 
|[T]. We consider a scenario where the coordinates (in me- 
) + b'^u!iUJ2+ te'") of each source/destination nodes Si/Di are as follows: 

+ 6(2wi + L02)){ip2 - V^2 + b{2LU2 + ^i)), ^i(300, 300), i:»i(500, 645), ^2(390, 257) and D2(590, 603). 

The channel gains are given by 0.097/d"' where d is 

the distance between the transmitter and the receiver. The 
noise power and transmission powers of the users and relay 



are 10~^^ Watt, 0.1 Watt and 0.08 Watt respectively, and a 
is set to 0.8 bit/s per Hz. The constants b and M are set to 
lO^'^ and 80 respectively, while the bandwidth uj is fixed to 1 
MHz. 
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Figure 4: Gains in terms of individual bandwidth for user 1 
when operating at the NBS instead of the NE. 
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Figure 5: Gains in terms of system/total bandwidth by 
operating at the NBS instead of the NE. 

Our results highlight that the NBS requires less bandwidth 
than the NE. Additionally, the energy-efficiency (with pricing) 
at the NBS is higher than the one at the NE. In Fig. S] 
we represent the relative bandwidth gain (NBS vs NE) in 
% of user 1 w.rt. the coordinates of the relay (the relative 
gain of user 2 shows a similar behavior). Simulations show 
that maximum gains are obtained when € [400, 550] m. 
Moreover, for different y^, there are some regions of Xr where 
the gains vanish. In these regions, the optimum bandwidth 
with NBS is equal to that with NE. Since user 1 cannot profit 
from the presence of the relay (when this latter is far from 

= 0. However, user 2 



NE 



,NBS 



the source), we have uj{ 
maximizes its utility at the NE (when the relay is much closer 
to its location) and we have = uj2^^ ^ 0. This shows 
that our analysis provides some insights to an operator who 
would like to optimize the location of a relay. 




:r-eoordiiiate of the relay (r,.) [m] 



Figure 6: Gains in terms of sum energy-efficiency with 
pricing (social welfare) by operating at the NBS instead of 
the NE. 



In Fig. |5] we plot the gain in terms of total bandwidth 
demand. Thus, we deduce that a maximum gain with NBS is 
reached when the relay is positioned at {xr,yr) G [400, 550] x 
[400, 550] m-. In this region, the total bandwidth demand is 
reduced to 20 — 25%. The study of the social welfare in Fig. 
|6] confirms that the maximum energy-efficiency (with pricing) 
gain, which is around 10— 12%, is reached in the same region. 
Consequently, the results obtained according to the strict- 
concavity analysis are well confirmed when implementing the 
conjugate gradient algorithm. 

V. Conclusion 

This paper studies an efficient solution for a relevant game 
introduced in [1 1 by referring to the NBS. Remarkably, up to a 
time-sharing argument, the corresponding spectrum allocation 
game can be checked to possess all the properties to have a 
unique NBS. Through implementing a conjugate gradient al- 
gorithm in a decentralized way, considerable gains of 20—25% 
can be obtained in terms of used bandwidth. The results 
reached for the two-user case are very encouraging to extend 
the case study to larger multi-user systems. Interestingly, 
our analysis gives some insights into how to deploy some 
relays for improving a distributed network both from a spectral 
and energy standpoint. This paper is a first step towards 
designing fully distributed algorithms (in terms of channel 
state information) or learning techniques which converge to ef- 
ficient solutions such as the NBS or Raiffa-Kalai-Smorodinsky 
solution ; this task is known to be challenging and this paper 
shows the existence of relevant wireless scenarios where this 
objective might be reachable. 
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